RFRCDB-siRNA: Improved design of siRNAs by random forest regression model coupled with database searching

نویسندگان

  • Peng Jiang
  • Haonan Wu
  • Yao Da
  • Fei Sang
  • Jiawei Wei
  • Xiao Sun
  • Zuhong Lu
چکیده

Although the observations concerning the factors which influence the siRNA efficacy give clues to the mechanism of RNAi, the quantitative prediction of the siRNA efficacy is still a challenge task. In this paper, we introduced a novel non-linear regression method: random forest regression (RFR), to quantitatively estimate siRNAs efficacy values. Compared with an alternative machine learning regression algorithm, support vector machine regression (SVR) and four other score-based algorithms [A. Reynolds, D. Leake, Q. Boese, S. Scaringe, W.S. Marshall, A. Khvorova, Rational siRNA design for RNA interference, Nat. Biotechnol. 22 (2004) 326-330; K. Ui-Tei, Y. Naito, F. Takahashi, T. Haraguchi, H. Ohki-Hamazaki, A. Juni, R. Ueda, K. Saigo, Guidelines for the selection of highly effective siRNA sequences for mammalian and chick RNA interference, Nucleic Acids Res. 32 (2004) 936-948; A.C. Hsieh, R. Bo, J. Manola, F. Vazquez, O. Bare, A. Khvorova, S. Scaringe, W.R. Sellers, A library of siRNA duplexes targeting the phosphoinositide 3-kinase pathway: determinants of gene silencing for use in cell-based screens, Nucleic Acids Res. 32 (2004) 893-901; M. Amarzguioui, H. Prydz, An algorithm for selection of functional siRNA sequences, Biochem. Biophys. Res. Commun. 316 (2004) 1050-1058) our RFR model achieved the best performance of all. A web-server, RFRCDB-siRNA (http://www.bioinf.seu.edu.cn/siRNA/index.htm), has been developed. RFRCDB-siRNA consists of two modules: a siRNA-centric database and a RFR prediction system. RFRCDB-siRNA works as follows: (1) Instead of directly predicting the gene silencing activity of siRNAs, the service takes these siRNAs as queries to search against the siRNA-centric database. The matched sequences with the exceeding the user defined functionality value threshold are kept. (2) The mismatched sequences are then processed into the RFR prediction system for further analysis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved and automated prediction of effective siRNA.

Short interfering RNAs are used in functional genomics studies to knockdown a single gene in a reversible manner. The results of siRNA experiments are highly dependent on the choice of siRNA sequence. In order to evaluate siRNA design rules, we collected a database of 398 siRNAs of known efficacy from 92 genes. We used this database to evaluate previously proposed rules from smaller datasets, a...

متن کامل

Improved nucleic acid descriptors for siRNA efficacy prediction

Although considerable progress has been made recently in understanding how gene silencing is mediated by the RNAi pathway, the rational design of effective sequences is still a challenging task. In this article, we demonstrate that including three-dimensional descriptors improved the discrimination between active and inactive small interfering RNAs (siRNAs) in a statistical model. Five descript...

متن کامل

Production of Cyclin D1 specific siRNAs by double strand processing for gene therapy of esophageal squamous cell carcinoma

Background: RNAi (RNA interference) is a new strategy in gene therapy and biotechnology which provides new promises in the treatment of different diseases such as cancer and viral diseases. CCND1 which is a key gene in cell cycle is amplified and over expressed in esophageal cancer. The objective of this study was production and siRNAs for CCND1, the key gene in cell cycle. Materials and Metho...

متن کامل

Utilizing Selected Di- and Trinucleotides of siRNA to Predict RNAi Activity

Small interfering RNAs (siRNAs) induce posttranscriptional gene silencing in various organisms. siRNAs targeted to different positions of the same gene show different effectiveness; hence, predicting siRNA activity is a crucial step. In this paper, we developed and evaluated a powerful tool named "siRNApred" with a new mixed feature set to predict siRNA activity. To improve the prediction accur...

متن کامل

Rational design of immunostimulatory siRNAs.

Short-interfering RNAs (siRNAs) have engendered much enthusiasm for their ability to silence the expression of specific genes. However, it is now well established that siRNAs, depending on their sequence, can be variably sensed by the innate immune system through recruitment of toll-like receptors 7 and 8 (TLR7/8). Here, we aimed to identify sequence-based modifications allowing for the design ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer methods and programs in biomedicine

دوره 87 3  شماره 

صفحات  -

تاریخ انتشار 2007